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ABSTRACT Bacteriophage X integrates into the chromo- 
some of its Escherichia coli host by means of a site-specific 
recombination between a locus on the phage chromosome 
(phage att site) and a locus on the bacterial chromosome (bac- 
terial att site). The nucleotide sequence of four X att sites altered 
in site-specific recombination has been determined. The int- 
dependent deletions that generated these att sites have one end 
point within the phage att site and extend either to the left or 
to the right. As a result of the new internucleotide bond created 
by deletion formation, these phage have alterations in the 15- 
base-pair common core region. The new DNA sequences 
brought to the att sites by the deletions, designated A for regions 
to the left and A' for regions to the right, do not share any dis- 
cernible homology with their analogous counterparts in the 
phage att site arms, P and P', respectively, or with the bacterial 
att site arms, B and B', respectively. The finding of alterations 
in the 15-base-pair common core region necessitates a reinter- 
pretation of the genetic properties of these att sites in site-spe- 
cific recombination. The structure of these sites in relation to 
their genetic properties can be viewed as being consistent with 
a model in which the only specificity elements in inf-dependent 
site-specific recombination are the common core region, O, and 
the phage arms, P and P\ 

Bacteriophage X is an extremely useful model system for 
studying site-specific recombination, due in a large part to the 
availability of mutants directly affecting the recombination 
process. The mutations can occur either in genes coding for 
proteins required for this process (1) or in the region of the DNA 
(att sites) used as substrates for the reaction (2-5). This report 
describes results that begin to identify those features of the att 
site structure essential for site-specific recombination. 

As first proposed by Campbell (6), upon injection into its host, 
Escherichia coli, the DNA of X can integrate into the bacterial 
genome by a reciprocal recombination event. This reaction 
requires a functional phage gene, int (7-9), host factors (10 and 
11), and reaction sites on the DNA of both phage and bacterium 
designated att P (POP') and attB (BOB'), respectively (12). As 
a result of integration, X is inserted linearly into the bacterial 
genome, and two prophage att sites, one at each end of the 
integrated phage, attL (BOP') and attR (POB'), are formed. 
Excision, the reverse reaction, recombines attL with attR in 
the presence of a functional int gene and an additional phage 
gene, xis (13, 14), to regenerate the original attP and attB sites. 
Genetic evidence has shown that these two reactions occur with 
precision, and that the att sites are not altered by cycles of in- 
tegration and excision (15). 

Both genetic experiments (16) and DNA heteroduplex 
mapping analysis (17) have indicated that attR and attB are 
nonhomologous. Recent DNA sequence analysis has revealed, 
however, a 15-base-pair region of homology shared among attP, 
attB, attL, and attR, designated the common core, O (18). It 
is in this region that the crossover during integration and exci- 
sion is presumed to take place. We would like to know which 
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features enable the att sites to participate in site-specific re- 
combination. In an attempt to answer this question, we have 
analyzed four phages that contain different structural alter- 
ations in their att sites but are still proficient in certain aspects 
of site-specific recombination. These phages all have deletions 
with one end point located within the phage att site (17) and 
whose generation depended upon the presence of a functional 
int gene (3). 

MATERIAL AND METHODS 
Phage. Xfc511c/am, Xi»508c/am, and X£>522c/am were ob- 
tained from J. S. Parkinson (3). Xfc2c/857 was provided by F. 
Blattner. As a source of wild-type DNA, Xc/857S7 was used. 
Phage were grown and their DNA was extracted by using de- 
scribed procedures (19). 

Enzymes. Restriction endonucleases EcoRI, Hindlll, Mbo 
II, and BamHI were isolated by described procedures (20). Hpa 
II, Hha I, Hae III, Hinfl, and Alu I were purchased from New 
England BioLa I >s Assay conditions for these enzymes have been 
described (21). Bacterial alkaline phosphatase was purchased 
from Worthington. Phosphorylation of 5' ends of DNA frag- 
ments was carried out by using polynucleotide kinase from P-L 
Biochemicals or from Boehringer Mannheim. DNA polymerase 
I (Boehringer Mannheim) in conjunction with DNase 
(Worthington) was used for nick translation according to the 
procedure of Rigby et al. (22). 

Gel Fractionation. Restriction fragments were fractionated 
by electrophoresis (on either 1% agarose or polyacrylamide 
gels), visualized, eluted, and prepared for sequencing as de- 
scribed (21). 

DNA Sequencing. The methods of Maxam and Gilbert (23) 
were used for 5'-end-labeling of DNA fragments and the sub- 
sequent nucleotide sequence determination. Sequencing gels 
were routinely 12% polyacrylamide with 8 M urea. For the 
determination of sequences from 150 to >200 bases from the 
5' labeled end, 0.5-mm "thin" gels as described by Sanger and 
Coulson (24) were used. In addition, the gel length was in- 
creased to 70 cm and the run was at >2000 V. 

RESULTS 

Isolation of Restriction Fragments Containing the att 
Sites. Each of the four phages used in this study (Xb2, X&511, 
Xfc508, and X&522) has extensive deletions of DNA. Moreover, 
each deletion has one end point that has been localized within 
or very close to the att site (Fig. 1) by genetic crosses (3) and 
DNA heteroduplex analysis (17, 25). To identify a primary 
restriction fragment containing the att site, restriction digests 
of the deletion phages were compared with those of wild-type 
phage. As expected, each deletion phage lacked one or more 
restriction fragments as a result of the deletion. In addition, each 
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Fig. 1. Physical and genetic map of bacteriophage X. The X chromosome is shown as a linear molecule with various phage genes represented 

by letters. Below are regions of DNA deleted ( ) in the four phages studied: X62, X6511, Xb508, and X6522. Deletion end points are given as 

% X, assuming the cohesive ends of X are at 0 and 100% from left to right. The end points have been fixed by electron microscope DNA heteroduplex 
mapping (25). Locations of EcoRI (t) (26), SamHI (J) (27), and Hindlll (•) (19, 28) restriction enzyme cut sites are noted. 



digest also contained a single unique fragment characteristic 
of that particular deletion phage. This unique fragment should 
contain the "novel joint" or new internucleotide bond created 
by the deletion. Double restriction enzyme digestions with 
EcoRl and HindlH were used to isolate primary fragments 



from Xfo508 and Xfc522, Eco RI and BamUl were used to digest 
Xfc2 and Hindlll and BamUl were used for Xb511. 

Primary fragments were isolated preparatively and then 
subjected to further restriction enzyme analysis to obtain 
fragments small enough for DNA sequencing. A number of 
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Fig. 2. Fine-structure restriction maps of att site containing fragments isolated from X+ (18) and the four deletion phages, X62, X6511, 
X6508 and X6522. Map distances are given as nucleotide pairs from a fixed starting point, O, located at the center of the core (18) with positive 
numbers representing material to the right of att and negative numbers for material to the left of att. * indicates that relative order of these 
restriction sites in this region have not been determined. Labeled DNA strands used for sequence determination («~) are below each map with 
* representing 32 P -labeled 5'-termini and — showing direction of sequence. 
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FIG. 3. Autoradiographs of two sequencing gels, showing both 
strands of the Hinfl/Hinfl fragment spanning the att site of Xh51 1. 
Reaction conditions for cleavages at T, C, and G were those described 
by Maxam and Gilbert (23). Cleavage at A was obtained by heating 
the sample in 1 M NaOH at 90°C for 8 min, followed by the standard 
treatment in 0.5 M piperidine for 30 min at 90°C. Electrophoresis was 
in 10% acrylamide (B) or 12% acrylamide (A). (A) Sequence derived 
from the 5' Hinfl terminus located in the A arm of the AOP' phage. 
The core sequence has been underlined and the numbers represent 
distance in base pairs from the center of the core. Base +1 then is 
actually 54 base pairs from the 5 ' terminus of this fragment. (B) Se- 
quence derived from the complementary strand with the core se- 
quence again underlined. In this case the base labeled +6 is 200 bases 
from the 5' terminus located in the P arm of the phage. 

methods were used to generate the restriction fragment maps 
of the primary fragments shown in Fig. 2. In some cases the 
partial mapping procedure of Smith and Bimstiel (29) was used. 
As an example, the 5' EcoRI end of the EcoRl/BamHl primary 
fragment of Xfc2 was labeled with 32 P. This fragment was then 
digested with various enzymes under conditions such that di- 
gestion does not go to completion. The number and spacing of 
fragments generated by partial digestion as seen by autoradi- 
ography could then be used to construct an order of fragments 
obtained in a complete digest. We were also aided in ordering 
restriction fragments by prior knowledge of the restriction map 
of the wild-type phage att site (18). Note that all of the primary 
fragments contain either a wild-type P or P' arm. 

Nucleotide Sequence Analysis of the Hybrid att Sites. The 
arrows in Fig. 2 indicate the direction in which DNA was se- 
quenced from the ^P-labeled 5'-end by the procedure of 
Maxam and Gilbert (23). Hinfl fragments which spanned the 
att site in all of the phages analyzed were kinase-labeled and 
strand separated, and both strands were sequenced. As an ex- 
ample, the two gels in Fig. 3 show complementary portions of 
the Hinfl fragment spanning the att site in A&511. 



It is worth noting that, although the phages we have used for 
sequencing are of independent origin from those sequenced 
by Landy and Ross (18), our sequence data (Fig. 4) for the P and 
P' arms are in complete agreement. We do not find the dis- 
crepancies noted by Da vies et al. (30) at positions +18, +35, 
and +63 (Fig. 3A) in their sequence analysis of the P' arm. In 
addition, our sequencing data have provided a complement for 
a short stretch of 17 bases at positions +35 to +52 in the P' arm 
(18). 

DISCUSSION 

Although the general model proposed by Campbell (6) for 
site-specific recombination in X has been shown to be correct, 
the detailed mechanism for this process at the molecular level 
is not yet understood. In particular, little is known about the 
specificity encoded by the DNA substrates for this reaction. We 
have reported here the analysis of four mutant att sites whose 
recombination frequencies in site-specific recombination differ 
from the frequency of a wild-type phage att site, POP'. 
Whereas POP' recombines efficiently with BOB', neither the 
att sites of the leftward deletion phages (AOP') nor the right- 
ward deletion phages (POA') recombine efficiently with BOB' 
(3, 31). Despite this difference, these mutants do not represent 
a null phenotype with respect to site-specific recombination, 
because each is capable of recombining efficiently with certain 
att sites other than BOB' (3, 31, 32). In fact, it has been possible 
to characterize the mutants on the basis of their relative ef- 
ficiencies in in<-dependent recombination with several dif- 
ferent att sites. Because of this, the prevailing hypothesis has 
been that the mutants, like the wild-type att sites, contained 
a common core region, O, and that the measurable differences 
were due to different flanking sequences A and A'. 

From our analysis we find that indeed there is an identifiable 
common core sequence in AOP' and POA' att sites, but it is not 
identical to that of wild-type (Fig. 5). Thus, it is not possible at 
this time to attribute the characteristic genetic properties of 
these mutants in site-specific recombination solely to the 
flanking sequences A or A'. 

Comparison of the mutant core sequences with those of the 
wild-type reveals a conserved five-base sequence in the center 
of the core 

-2 0 +2 
T-T-T-A-T. 

The conservation of this region in four different mutant att sites 
is suggestive evidence for it being at least one of the critical 
regions necessary for some aspect of site-specific recombination. 
Our present data do not include a sufficient number of mutant 
att sites to define this critical region with great certainty or 
precision. In two att sites that might be regarded as analogous 
to those described here, the left (AOP') and right (POA') pro- 
phage att sites of a X chromosome inserted into the irpC gene 
of E. coli have a six-base-pair conserved sequence located at the 
center of each of the core regions (G. Christie and T. Piatt, 
personal communication). It is interesting, however, that cal- 
culations based on (i) the number of secondary att sites within 
a segment of the E. coli genome (33) and (ii) the number of 
inf-dependent derived deletions in the nonessential region of 
X (3, 31) predict a five- or six-base-pair recognition sequence 
to account for the apparent specificity of integrative recom- 
bination. This approximation is, of course, based upon the 
simplifying assumption of a random DNA sequence for the 
genomes of E. coli and X. 

It is clear that in a functional sense both the P and P' arms 
cannot be replaced by the corresponding A and A' sequences. 
One may now ask whether it is possible to replace the B and B' 
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~$° "f° P|A*1 -?0 0 p'|v+\ -20 

AGTCTGTTTTTTAT6CAAAATCTAATTTAATATATT6ATATTTATATCAT^TACGTTTCTCGTTCA|BCTTTTTf ATACTA^ 

TCAGACAAAAAATACGTTTTAGATTAAATTATATAACTATAAATATAGTAAAATGCAAAGAGCAAG7 CGAAAAAATATGATl CAACCGTAATATTTTTTCG 

A(xb2) p'lx*] 

TCAGTATGGAAGAGCAACCATTACAGGAAGTTTATCCACAGTTAAA66AACCAACACACAGGGTATT BCTTATTTATACTAA 3TTGGCATTATAAAAAAGC 
ATAGGTGTCAATTTCCTTGGTTGTGTGTCCCATAACGAATAAATATGATl CAACCGTAATATTTTTTCG 

AIXb511) p'(A + ) 
CAGTTAAATATGAAGGTAATTTCTTTTGTGCAAGTCTGACTA*CTTTTTTATACTAASTTGGCATTATAAAAAAGC 
ACTTCCATTAAAGAAAACACGTTCAGACTGAT TGAAAAAATATGATT CAACCGTAATATTTTTTCG 

B B' 

GGAAACTGAAAATGTGTTCACAGGTTGCTCCGGGCTATGAAATAGAAAAATGAATCCGTTGAAGCCT 3CTTTTTTATACTAA CTTGAGCGAAACGGGAAGG 
CCTTTGACTTTTACACAAGTGTCCAACGAGGCCCGATACTTTATCTTTTTACTTA66CAACTTCG6 A|cGAAAAAATATGAT1 SAACTCGCTTTGCCCTTCC 



MA*) 0 +20 P'(X*I +40 +60 +80 

CGTTTCTCGTTCaIgCTTTTTTATACTAA STTGGCATTATAAAAAAGCATTGCTTATCAATTTGTTGCAACGAACAGGTCACTATCAGTCAAAATAAAATCATTATTTGA 
GCAAAGAGCAAGT CGAAAAAATATGATT CAACCGTAATATTTTTTCGTAACGAATAGTTAAACAACGTTGCTTGTCCAGTGATAGTCAGTTTTATTTTAGTAATAAACT 

PIX1 A'(Ab508) 

CGTTTCTCGTTCA SCTTTTTTATGTAAA SAAACAGTAAGATAATACTCAACCCGATGTTTGAGTACGGTCATCATCTG 
GCAAAGAGCAAGT CGAAAAAATACATTT :TTTGTCATTCTATTATGAGTTG66CTACAAACTCATGCCAGTAGTAGACTGTGATGTCTGA 

PlA*j A'|Ab522| Mboll 

CGTTTCTCGTTCA 3CTTTTTTATACCAT CATTTCCAGCTTTTGTGAAAGGGATGCGGCTAACGTATGAAATtCTTCGTCTGTTTCTACT 

GCAAAGAGCAAGT CGAAAAAATATGGTA 3TAAAGGTCGAAAACACTTTCCCTACGCCGATTGCATACTTTAAGAAGCAGACAAAGATGACCATAACCGTGTTTGGACTA 

TCCGTTGAAGCC "CTTTTTTATACTAA'TTGAGCGAAACGGGAAGGTAAAAAGACAAAAAGTTGTTTTTAATACCTTTAAGTGATACCAGATGGCATTGCGCCATCTG 
AGGCAACTTCGGA pGAAAAAAf ATGATt} jAACTCGCTTTGCCCTTCCATTTTTCT6TTTTTCAACAAAAATTATGGAAATTCACTATGGTCTACCGTAACGCGGTAGAC 

Fig. 4. The nucleotide sequence surrounding the common core of the att sites in X62, X6511, X6508, and X6522. For comparison, the wild-type 
sequence of the P and B arms (18) have been placed above and below, respectively, the A arms of X62 and X6511. Likewise, the P' and B' arms 
have been aligned with the two A' arms of X6508 and X6522. The core structures are denoted by boxed regions. Where only the sequence of one 
strand is shown, we wereunable to obtain the complementary strand because of proximity to the 3' terminus. In these regions, sequence was 
confirmed by repeated sequencing of the strand shown. 



arms without loss of function. In the case of the B arm, the an- 
swer is certainly yes. Although Xfc2 and Xfc511 do not recombine 
efficiently with a BOB' att site, they do recombine efficiently 
with a POB' att site (3, 31, 32). In fact, their genetic behavior 
in site-specific recombination is completely analogous to that 
of the BOP' att site; BOP' does not recombine efficiently with 
BOB' but does recombine efficiently with POB' (3, 12, 31). 
From this genetic data it appears that the A arms of Xfo2 and 
A&51 1 are equivalent to the B arm in terms of their contribution 
to inr-mediated site-specific recombination — that is, without 
any apparent homology between the A arms and the B arm. 

For the replacement of B' with A' sequences, the genetic data 
are more variable. In this instance, int-dependent recombi- 
nation between the POA' and BOP' att sites proceeds at l/10th 
to 1 /2 the efficiency of the analogous recombination between 
POB' and BOP' (3, 31). Because the range of measurable re- 



Xb2 (API 
Xb511 (AP) 
Xb508 (PA) 
Xb522 (PA) 



GCTTTTTTATACTAA 
GCTTATTT ATACT AA 
ACTTTTTTAT ACTAA 
GCTTTTTTATGTAAA 
GCTTTTTTATACCAT 



CONSERVED CTT TTTAT A 

Fig. 5. Comparison of the nucleotide sequences of the cores 
from X+, Xb2, X6511, X6508, and X6522. Bases that are common to all 
five cores are designated "conserved." 



combination efficiencies extends over several orders of mag- 
nitude, we do not view this as a very large reduction in effi- 
ciency for the POA' sites. Considering the absence of any ap- 
parent homology between the A' sequences and B' sequences, 
it seems remarkable in fact that the A' arms would function at 
all if the B' arm sequence were essential for site-specific re- 
combination. (The relatively small reduction in recombination 
efficiency might well be explained by the changes found in the 
core structure.) Thus, our tentative interpretation of the results 
reported here in conjunction with prior genetic data on the 
behavior of these mutants in site-specific recombination lead 
us to favor a model similar to one suggested by Shulman et al. 
(5): Site-specific int-dependent recombination requires only 
three DNA elements — the common core region, O, the P arm, 
and the P' arm. It should be emphasized, however, that this 
model cannot be satisfactorily tested without finding, or con- 
structing, att sites that have altered arms flanking a completely 
wild-type core region. 

The interaction of these mutant att sites with purified Int 
protein presents some surprising results. Kikuchi and Nash (34, 
35) have shown that, in a filter-binding assay, purified Int 
protein (in the absence of any other factors) binds to POP', 
BOP', and AOP' (Xi2 and Xb511) with equal efficiency and 
20-30 times more efficiently than to BOB', POB', and POA' 
(Xi517). [According to Davis and Parkinson (17), X&517 is 
probably the same as X&508 sequenced in this paper.] These 
data argue strongly in favor of an important role for the P' arm 
in terms of Int-binding. An important role for P' was also in- 
ferred from the structure of the wild-type phage att site; an 
11-base-pair sequence of the common core region is present in 
the P' arm as a perfect inverted repeat (18). It is clear from the 
sequences reported here that phages that have maintained the 
inverted repeat in P' (Xb2 and X&511) bind Int more efficiently 
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than does X&508 which no longer has the P' inverted repeat. 
Furthermore, the single base changes in the core regions of A&2 
and X&511 do not seem to reduce the efficiency with which 
either of these bind Int protein in the filter assay. It should be 
pointed out that integrative recombination is a complex reaction 
and, therefore, filter binding assays of purified Int may reflect 
only one aspect of the role of this protein in the overall pro- 
cess. 
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Spontaneous deletion mutants of the temperate lactococcal bacteriophage BK5-T were obtained when the 
phage was grown vegetatively on the indicator strain Lactococcus lactis subsp. cremoris H2. One deletion mutant 
was unable to form stable lysogens, and analysis of this mutant led to the identification of the BK5-T attP site 
and the integrase gene (int). The core sequences of the BK5-T attP and host attB regions are conserved in a 
number of lactococcal phages and L. lactis strains. 



BK5-T is a temperate lactococcal bacteriophage that can be 
induced from the lysogen Lactococcus lactis subsp. cremoris 
BK5 by mitomycin treatment (9). Under certain propagation 
conditions, BK5-T spontaneously loses the ability to lysog- 
enize, possibly as a result of one or more deletion events (7). 
The availability of detailed information concerning the BK5-T 
genome (3, 11) prompted us to seek deletion mutants of 
BK5-T that had lost the ability to form stable lysogens as a 
strategy for identifying genes essential for the establishment 
and/or maintenance of lysogeny. Characterization of one of 
these mutants enabled us to identify and sequence the phage 
and host attachment sites, attP and attB, respectively. Deletion 
of a 536-codon open reading frame (ORF) and tandemly re- 
peated segments within a 1,904-codon ORF in BK5-T did not 
affect the frequency of lysogeny, thereby eliminating the pos- 
sibility that these gene products are required for lysogeny. 

Deletions within the BK5-T genome during lytic propaga- 
tion. The indicator strain L. lactis H2 (9) was infected with 
BK5-T.H2L (BK5-T isolated after induction of the lysogen L. 
lactis H2L [3]) and incubated until the culture lysed. The cul- 
ture was centrifuged (5,000 X g) for 10 min, and the cell-free 
lysate was added to an appropriate volume of uninfected cells. 
This lysis/infection cycle was repeated 20 times, and the result- 
ing phage was designated BK5-T.H2cyc20. Analysis of EcoRI- 
Pstl digests of BK5-T.H2L DNA and BK5-T.H2cyc20 DNA, 
together with hybridization studies with Eco Rl-a and Eco Rl-b 
(11), showed that at least nine fragments (in addition to the 
fragments containing cos) in the BK5-T.F£2cyc20 DNA digest 
were present in submolar amounts (Fig. 1). Five of these frag- 
ments were produced by specific deletions within £coRI-b (11) 
(Fig. IB, lane 2). £coRI-b contains four perfect tandem re- 
peats of 468 bp and a fifth incomplete tandem repeat within 
the large ORF1904 (3). The 7.4-kbp fragment that hybridized 
to £coRI-b was the full-length £coRI-b(Pl) fragment (11), 
while the sizes of four other submolar fragments (6.9, 6.4, 6.0, 
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and 5.5 kbp) were consistent with their being produced by the 
loss of one, two, three, or four of the 468-bp tandem repeats, 
respectively. The other four submolar fragments (5.2, 4.3, 3.2, 
and 2.1 kbp in size) resulted from deletions within £coRI-a 
(11) (Fig. 1C, lane 2). 

Characterization of BK5-T deletion mutants. The presence 
of many submolar fragments in restriction digests of BK5- 
T.H2cyc20 DNA suggested that the BK5-T.H2cyc20 prepara- 
tion comprised a mixed population. To characterize the dele- 
tions more precisely, three deletion mutants of BK5-T (BK5- 
T.H2A8, BK5-T.H2A10, and BK5-T.H2A11) were randomly 
selected from single plaques of BK5-T.H2cyc20. Deletions 
within BK5-T.II2A8, BK5-T.I I2A10, and BK5-T.I I2A 1 1 DNA 
were located by restriction mapping, Southern hybridization 
(Fig. 1), and DNA sequencing. Each mutant contained a de- 
letion within the tandem repeat region of ORF1904 (Fig. 2) 
(3), while BK5-T.H2A8 and BK5-T.H2A10 also contained de- 
letions within £coRI-a. The deletion within ORF1904 could be 
explained by a single crossover recombination event between 
two 5-bp "core sequences" of ACGGA situated at 4198 and 
6070 bp (3) in BK5-T.H2A11 DNA or between two 8-bp ho- 
mologous regions situated at 4222 and 6094 bp (3) in BK5- 
T.H2A8 DNA (Fig. 2). Each of these deletions removed 1,872 
bp of DNA, equivalent to four of the 468-bp tandem repeats 
identified previously (3). These deletions shortened ORF1904 
by 624 codons without changing the reading phase. BK5- 
T.H2A10 phage comprised a mixed population of phages car- 
rying deletions within ORF1904 corresponding to the loss of 
two, three, or four 468-bp repeats. The precise endpoints of 
these deletions could not be determined. Each phage deletion 
mutant was able to propagate vegetatively on L. lactis H2, 
yielding slightly larger plaques (1 to 1.5 mm) than did BK5- 
T.H2L (0.5 to 1 mm). 

FASTA (15) comparison of the ORF1904 amino acid se- 
quence with all protein sequences in GenBank showed signif- 
icant homology with a number of proteins from the collagen 
family. This homology was centered around the repeated Gly- 
X-Y motif, found 64 times in ORF1904. FASTA analysis also 
indicated similarity between ORF1904 and an unidentified 
ORF (ORF35) from the lactococcal bacteriophage bIL67 (19) 
and less similarity to the lactococcal lytic bacteriophage US3 
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IN;. I. I lccliophoiclicscp.il. moil of/ HI /VI icsliiclion digests ol 'UNA from lifs> l.ll.'l .mil dclct ion mutants. ( A ] l;ls> I I >.\ A \\ .is di.n sk d « ii h holh / , oKI 
and /'.v/I, and the rcsultim; Ii.ibiik nl ■ \u ic clccli ophoicscd ihioueh .1 0..S' , (w 1 \ol ) aitatosc ucl and si, lined w ilh clhidium bromide. The DNA was from 15K5-T.II2I. 
(lane I), lilO-T.I I2eve20 (lane 2), 15K.a-l.H2Al I (lane 3), 15Io- 1.1 12AS (lane 4), I Uo- I.I I2AI (1 (lane N, and A UNA digested with /VI (lane (.1. (IS) Soulhem hlol 
of the gel shown in panel A altci h\ hi idi/.al ion w ilh 'P-labclcd /.™RI-b ( 1 1 ). (( ') Soulhern hlol of the gel show n in panel Aaflei h\ hi idi/al ion w ilh ' P-labeled£coRI-a 
(II). Numbers on [he 1 ighl ol each panel indicate the si/es (in kilohase pails) of liagmcnls liom a /VI digest of ,\ I >NA, and allows show positions of fragments present 
in the digests of mutant phages which were not present in the digest ol'lifo- I.I 121 . Numbers on the left of panel A indicate [he si/es (in kilohase pairs) of BK5-T.H2L 
fragments. 



lytic enzyme (16) and the Bacillus subtilis xylose isomerase 
(22). Analysis by COMPARE (18) indicated similarity between 
ORF1904 and numerous proteins involved in binding and/or 
degradation of cell wall glycoproteins. These sequence similar- 
ities and the observation that a number of cell wall-lytic en- 



zymes contain repeated sequence motifs (5, 8, 10, 12) suggest 
that ORF1904 may be involved in cell lysis during the lytic 
cycle of BK5-T or in cell wall hydrolysis to enable phage DNA 
injection. 

The genomes of BK5-T.H2A8 and BK5-T.H2A10 also con- 



EcoRl-e ,£coRI-f, EcoRI-d 



_| BK5-T.H2A11 
_| BK5-T.H2A8 |_ 



BK5-T.H2A10 | | 

0 1 2 3 4 5 6 7 8 9 101112 kbp 
I I I I I I I I I I I I I 

FIG. 2. Physical map of the BK5-T genome. (A) EcoSl (E) restriction map of the BK5-T genome. Restriction fragments arc named as described previously (II). 
The ends of the m i| | I I UN A (3). (B) Physical mrp it I I | II I 11/ I III )/ I 

and adjacent numbers show die numbei of codons in each ORF from the first ATG codon to the first stop codon. ORF'410 is open at the 5' end. The positions of 
the tandem duplications arc indicated In solid and open rectangles. anP is the phage attachment site. The positions of the deletions within the genomes of 
. . • " ec l. Since BK5-T.II2AI0 was a mixed population ol phages earning various 
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BK5-T attP CATCTATCCTTTACTATGTAGTTCTTCATGAACGATAATAAACAATAATAG 

L. lactis H2L attR CATCTATCCTTTACTATGTAGTTCTTCATGGGAGCTCAAGAAGAATAAAAG 

L. lactis H2L attL AAAGAATTGCAAACCTTGTTATTCTTCATGAACGATAATAAACAATAATAG 

L. lactis H2 attfl AAAGAATTGCAAACCTTGTTATTCTTCATGGGAGCTCAAGAAGAATAAAAG 

l'Ki. 3. Nucleotide sequence of the «» regions of lilo-l", /.. fcrf/S 112, and /.. luvtis 1 121.. (A) Ph\sical m.ip of [lie lSIO-1 prophage. Bio- 1 is designated by Ihe open 
rectangle, while die /,. lactis cliroiiiosoinal DNA is indicak d h\ ihv hoi i/onlal line, i he positions of oligonucleotides used to amplify the cut I. nmlallR regions are shown 
by arrows, with the arrowheads designating the 3' end of each oligonucleotide. (I!| Nucleotide sequences of a:: I'. auR, mil., and anil. I he designation at an I. and attR 
was in accordance w ith the s\stein of l.illehaug and I'.iikeland (131. file nucleotide sequence of each legion w as detei mined as desci ihed in the text. I he 9-nucleolide 
core sequence 5'-TTCTTCATG-3' present in all att regions is shown in boldface type. The sequence of the attL, attR, and attB regions was determined in only one 



tained deletions within EcoRl-a (Fig. 2). These deletions re- 
sulted from a single-crossover recombination event between 
two 11-bp core sequences of TTTTTTTGTTT situated at 8677 
and 10701 bp in BK5-T.H2A8 DNA (3) (Fig. 2) or between two 
6-bp core sequences of GTGTTT situated at 8674 and 11737 
bp in BK5-T.H2A10 DNA (3) (Fig. 2). Both of these deletions 
removed ORF536, while the BK5-T.H2A10 deletion also re- 
moved the ORF374-to-ORF536 intergenic region and 256 
codons from the C-terminal end of ORF374 (Fig. 2). 

The phage deletion mutants were tested for their ability to 
form lysogens onL. lactis H2. BK5-T.H2A11 and BK5-T.H2A8 
remained lysogenic, but BK5-T.H2A10 was not. This indicated 
that neither the full-length ORF1904 nor ORF536 (Fig. 2) is 
essential for vegetative growth or the formation and mainte- 
nance of stable lysogens. Coliphage \ also contains genes that 
encode products that are not essential for normal propagation 
(6). The function of the ORF536 protein is unknown, and its 
amino acid sequence exhibits no significant homology with any 
prokaryotic proteins in the GenBank database. The inability of 
BK5-T.H2A10 to form lysogens indicates that ORF374 and/or 
the ORF536-to-ORF374 intergenic DNA is essential for the 
establishment and/or maintenance of lysogeny. 

Localization of the BK5-T attachment site. Since BK5- 
T.H2A10 contained deletions within the EcoRl a fragment, 
previously shown to contain attP (11), it was decided to locate 
attP more precisely to determine whether loss of this feature 
was related to the nonlysogenic nature of BK5-T.H2A10. Sub- 
fragments of BK5-T £coRI-a were used to probe Southern 
blots of £coRI digests of chromosomal DNA from the BK5-T 
lysogens L. lactis BK5 and H2L (data not shown). BK5-T 
subfragments spanning attP would hybridize to two chromo- 
somal fragments, whereas subfragments which did not contain 
attP would hybridize to only one fragment. By using this ap- 
proach, attP was localized between 10769 and 10999 bp (3) and 
the phage/host junctions were shown to be in chromosomal 
Xbal fragments of 90 and 18 kbp in L. lactis BK5 and H2L 
(data not shown). 

The sequences of these phage/host junctions (attL and attR) 
in L. lactis H2L were then determined (Fig. 3B). For attL, this 



was done by sequencing a 900-bp PCR fragment, obtained by 
inverse PCR (14) with JB6 and JB7 as primers and a ligated 
Sspl digest of the gel-purified 9d-kbp Xbal chromosomal frag- 
ment as the template. The obtained sequence revealed 100% 
sequence identity between the 21 bp of host DNA adjacent to 
attL in BK5-T and in 4>LC3, another L. lactis temperate phage 
(13). The existence of this identity and of the 97% identity 
between the 1,627 bp of DNA surrounding attP of BK5-T and 
<}>LC3 (13) enabled us to use the nucleotide sequence sur- 
rounding attB in the <f>LC3 lysogen L. lactis IMN-C18 (13) to 
design PCR primers that amplified L. lactis H2 attB (JB48 and 
JB49 [Table 1]) and L. lactis H2L attR (JB8 and JB49 [Table 
1]) and provided DNA for sequencing. 

Comparison of the DNA sequences of the attP, attL, attR, 
and ullli regions (fig. 3Bj identified a common 9-bp core 
sequence, 5'-TTCTTCATG-3' (bases 10882 to 10874) (3), 
within which recombination between BK5-T and the host ge- 
nome is likely to occur. Thus, BK5-T attP is located in a region 
of the BK5-T genome that was deleted in BK5-T.H2A10 but 
not in BK5-T.H2A11 or BK5-T.H2A8 (Fig. 2). The deletion of 
attP in BK5-T.H2A10 provides an explanation for the inability 
of this phage to form stable lysogens. This observation is of 
particular interest in an industrial context, since to our knowl- 
edge it is the first demonstration of spontaneous mutations 



I Mil I I. Oligonucleotides used in this investigation" 





Oligonucleotide sequence (5'-3') 


Region of BK5-T* 


JB6 


GATCATTAGGAATACTCCCC 


10012-9994 


JB7 


GATCGACATGGGAGAAGGTAAAGG 


10259-10281 


JBS 


CACACAGCAAACCTATATCC 


11051-11032 


.JI54N 


TGTTAAAGCAGGAATCAAAGG 


Comp c 


JB49 


AATACCTAAGCACACGAAGGTT 





"Oligonucleotides were synthesized in an Applied Biosystems model 381 
DNA synthesizer. 

Sequenci numbers lelei to the nucleotide sequence ol ISfo-T as determined 
previously (3). 

c Comp, complementary to /.. tanh I YIN -( ' I 3 genomic DNA (13). 
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^ CAACAAAA^ TACAACAAATCATTAAAAATCAAGAATAAAGCCATTATCTAAAGCACATCTAT 10897 

lCGATAATAAACAATAATAGAATAAATATAAAATAAAAACGGCTTAATATAGCCGTTTTTCCTATTTTTAAAACAACTAAAAAATATAG AAAT 10777 
Rl' R5 Rl 

AAACTCGTACAACAAAATTACAACA^CATTATGTTTCTGATAG?AT^ 10657 

GTTGT AAAGAftAAA CGCCCCGGAGGGCGAGGAkTTTCAAAG^ 10417 



?GTTGATAAGGTTCTTTTATATTCCTTCAGTTATTATACTGTAGCTCATAACTATTTTTCTATAT 102 97 

FIG. 4. Nucleotide sequence surrounding BK5-1" ulll'. Ihe core mil' sequence is boxed, and repealed sequences (Rl, R5 ) and inverted repeats (R2, R3, and R4) 
surrounding alii', which ran he invoked in protein binding ncccssaiy loi phage inlcgialion, aie idem i lie d In ai ions aho\e the sequence with arrowheads to designate 
polarity. The 16 ( '-terminal amino acids of ORF374 (hit ) are shown in singlc-lcttei code above I ho sequence, with * designating the stop codon of the ORF. The core 
sequence involved in Ihe deletion within the phage genome Iik5-1'.H2A8 is indicated In periods and labeled AS. Sequence numbers are as presented previously (3); 
however, the complementary strand is shown to allow cas\ eompai ison with the sequences surrounding the MX '3 and l'uc2009 mil' legions (13, 20). 



resulting in a lytic phenotype in a temperate lactococcal bac- 
teriophage. These lytic phages are not virulent, because they 
cannot infect BK5-T lysogens and thus would grow only on 
strains which did not express the BK5-T repressor protein or a 
functional equivalent. 

BK5-T attP is identical to attP of the lactococcal bacterio- 
phages c|>LC3 and Tuc2009 (13, 20). Moreover, L. lactis H2 
attB and the 21 bases on either side are identical to the corre- 
sponding bases in the L. lactis indicator strains IMN-C18 
OLC3), UC509 (Tuc2009), and UC506 (Tuc2009) and differ 
by 1 base from that of MG1363 (Tuc2009) (13, 20). Five 
(TTCTT) of the nine bases of the BK5-T attP core sequence 
are identical to bases present in the 16-bp core sequence of the 
Lactobacillus gasseri phage 4>adh attP (17). There was no ap- 
parent homology between BK5-T attP and attP of the temper- 
ate lactococcal phage TP901-1 (4). This indicates that there are 
at least two classes of integration system in temperate lacto- 
coccal phages. 

The nucleotide sequence surrounding the BK5-T attP core 
sequence contained a number of repeated and/or palindromic 
sequences (Fig. 4). These regions of DNA may be important in 
binding integrase or an L. lactis IHF homolog. Identical se- 
quences also surround the cpLC3 and Tuc2009 attP sites (13, 
20). Because the deletion mutant BK5-T.H2A8 was able to 
form stable lysogens, only the repeated sequences at positions 
>10701 bp (Fig. 4) can be essential for phage integration. 

Deduced amino acid sequence of the BK5-T integrase. 
FASTA comparison of the deduced amino acid sequence of 
ORF374, which is adjacent to attP (Fig. 2), with all GenBank 
proteins revealed significant homology with a number of site- 
specific recombinases. In particular, ORF374 shared 99.4 and 
98.7% identity with the deduced amino acid sequences of the 
integrase proteins from the Tuc2009 and cj>LC3 phages, respec- 
tively (13, 20). All of these lactococcal phage in legrase proteins 
contain the highly conserved residues of site -specific recombi- 
nases of the \ integrase family (1). 

Overall homology between BK5-T, (bLC3, and Tuc2009. 
Comparison of the nucleotide sequence surrounding int and 
attP in BK5-T (bases 12212 to 10341) and Tuc2009 (bases 1 to 
1872 [20]) identified a region of 1,632 bp including int and attP 
that showed 97% identity at the nucleotide level. Preceding 
this homologous region, the nucleotide sequences diverge and 
the deduced amino acid sequence of ORF536, the adjacent 
gene in BK5-T, shows no homology with the protein encoded 
by the adjacent gene in Tuc2009. It is not possible at this stage 



to compare sequences at the other end of the 1,632-bp homol- 
ogous region. However, a similar patient of homologous and 
divergent regions is seen within the putative cl genes of the two 
phages (2, 21). This is suggestive of a common cassette con- 
taining the integration region (attP and int) and possibly cl in 
BK5-T, Tuc2009, and 4>LC3. Despite these similarities in the 
integration region, DNA from d>LC3 hybridizes with the type 
phage P335 whereas BK5-T DNA does not (4). Moreover, 
phage TP901-1, which contains an integration system different 
from those of BK5-T, ((>LC3, and Tuc2009, also belongs to the 
P335 group of phages (4). Two different types of DNA-pack- 
aging systems have also been identified in temperate lactococ- 
cal bacteriophages. The genomes of BK5-T and c|>LC3 contains 
cohesive ends, whereas Tuc2009 and TP901-1 contain a pac 
site and package DNA into the phage heads by a headful 
mechanism. Further comparison of nucleotide sequence data 
from these phages is necessary to clarify the evolutionary and 
functional relationships between them. 
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In toxigenic conversion of Corynebacterium diphtheriae C7, P bacteriophage DNA integrates into either of 
two chromosomal attachment sites, attBl or attB2. These attB sites share a 96-base-pair sequence with the attP 
sites of 3-related phages. The distribution of ottB-related sites in other species of Corynebacterium was assessed 
by hybridization with a DNA probe containing both attB sites of the C7 strain and a second DNA probe 
containing the attP site of a P-related phage. All but one of the 15 C. diphtheriae strains tested, regardless of 
origin or colonial type, contained at least two BamHl fragments that hybridized strongly to both of these probes 
under conditions of high stringency. Strains of C. ulcerans and C. pseudotuberculosis, species in which 
conversion to toxinogeny has also been demonstrated, also had one or two hybridizing BamHl fragments. The 
functionality of these sites as integration sites was demonstrated by isolating lysogens of all three species 
following single infection with one or more 3-related phages. As predicted, following lysogenization one of the 
DNA fragments that had exhibited homology with the attBl-attB2 probe was replaced by two hybridizing 
fragments. Other species of Corynebacterium, including pathogens and nonpathogens from animals, plant 
pathogens, and soil isolates also carried at least one BamHl fragment that hybridized with the attBl -attB 2 and 
attP probes. The data indicate that sequences homologous to the p phage integration sites in C. diphtheriae have 
been conserved in members of the genus Corynebacterium. 



Nontoxinogenic strains of Corynebacterium diphtheriae 
are converted to toxinogeny by members of the p family of 
corynebacteriophage (6). In addition diphtheria toxin- 
positive strains of C. ulcerans and C. pseudotuberculosis 
may be produced by phage conversion or may already carry 
tox+ p-related phages (1, 5, 6, 9). Genetic evidence and 
Southern hybridization experiments suggest that converting 
phages integrate into the bacterial chromosome by a process 
analogous to lambda lysogenization of Escherichia coli (2,8, 
10). Presumably, site-specific recombination occurs between 
a phage attachment site (attP) and a bacterial attachment site 
(attB). It has been shown that the C. diphtheriae genome 
contains two attB sites, attBl and attBl (13), and recently 
Rappuoli and Ratti (14) cloned and characterized the region 
of the C. diphtheriae C7 chromosome containing these sites. 
The attB sites are 2.25 kilobases apart but share a 96-base 
pair (bp) sequence which is also found in the attP sites of the 
closely related p, y, and oj phages [1, 2, 12; G. Ratti and R. 
Rappuoli, J. Cell. Biochem., 7B(Suppl.):155, 1983; personal 
communication]. Since lysogenic conversion of various 
strains of C. diphtheriae, C. ulcerans, and C. pseudotuber- 
culosis requires integration of a p-related phage, it was of 
interest to determine whether members of all three species 
contained sites similar to the attB sites of C7. In the present 
study, using attB and attP probes, sites homologous to those 
in C7 were detected in all strains of the three species 
examined. They were also detected in other pathogenic and 
nonpathogenic Corynebacterium spp. not presently known 
to be susceptible to or carriers of p-related phages. 

MATERIALS AND METHODS 

Bacteria, phages, and media. All of the bacterial strains 
used were from our culture collection. The strains are 
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mentioned specifically in the text or in figures, and their 
origins are listed in Table 1. In addition, phage 782-carrying 
lysogens of C. ulcerans 690, 712, and 872 and C. 
pseudotuberculosis 766 were isolated, as well as a C7 
lysogen of phage 76 (9) and a C. renale 412 lysogen of phage 
RP28 (7). All of the other lysogens used and their sources 
and the sources of C. diphtheriae phages p, 7, ir, a>, 8, k, p, 
and 782 and C. ulcerans phages 731, 761, 876, a, and h were 
previously described (1, 3). Phage 782 is a tox° p-related 
phage, i.e., one that fails to hybridize with probes specific 
for the diphtheria toxin gene tox. 

Bacteria were grown in tryptose-yeast extract broth con- 
taining tryptose (10 g), yeast extract (5 g), and NaCl (5 g) per 
liter or in heart infusion broth (Difco Laboratories, Detroit, 
Mich.). The agar media contained 1.5% agar per liter. 

DNA probes. The plasmid A634 contains the attBl and 
attB2 region of the C. diphtheriae C7 chromosome in a pUC8 
vector (14). This cloned insert, which will be referred to as 
the attBl -attB2 region and probe is diagramed in Fig. 1. As 
will be seen, this probe is probably not exclusively specific 
for arrfi-related sites; nevertheless, it proved useful in 
screening for such sites. An Hindi fragment of A634 that 
served as a probe to test for DNA flanking the attB sites and 
an attP probe obtained by digesting plasmid T30 DNA with 
AccI and EcoR\ are also diagramed. Plasmid T30 was 
isolated at the Sclavo Research Center, Siena, Italy, which 
supplied the DNA for our probe. It contains the attP-tox 
region of co phage. It has been shown that to phage is closely 
related to converting phage p (12). They have homologous 
attP sites and integrate into the same attB sites (13). The 
diphtheria toxin probe specific for the B fragment coding 
region was previously described (4). Bacterial and plasmid 
DNAs were extracted as previously described (4). 

Analytical procedures. The methods used for restriction 
enzyme digests, agarose gel electrophoresis, nick transla- 
tion, dot blot and Southern hybridizations, autoradiography, 
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TABLE 1. Bacterial si 



Stra 



C7 (770) 

51013 (8026) 

51014 (8028) 

51015 (8032) 

51016 (19409/3984) 
S601 



"C. diphtheriae 

(belfanti)" 
C. ulcerans (12) 



C. pseudotuberculosis 



C. pseudodiphtheriticum 



690 (9015) 

712 (51169) 
740 (1613/50) 
751 (378) 
755 (9304) 
872 (A238) 
S1019 (19410) 

766 (992) 
769 (21) 
771 (1113) 

411 (28) 

412 (8) 

51001 (9775) 

51002 (231) 



51017 (764) 

51018 (10673) 
S1020 (7715) 
S1023 (363) 
S1046 (6939) 
S1080 (13032) 



V. J. Freeman; 2 
ATCC 
ATCC 
ATCC 

ATCC; NCTC 
Seattle, Wash.; 
16 



Romania; 9 

ATCC; Albany, 

N.Y.; 5 
Albany, N.Y.; 5 
Norway; 5 
Wales; 16 
Romania; 9 
Romania; 9 
ATCC 

Romania; 9 
Romania; 9 
Romania; 9 



NCTC 

NCTC 
ATCC 
NCPPB 
ATCC 
ATCC 



" The strain designations include our stock number followed in parentheses 
by the stock number of the soure laboratory. 

h All strains listed were positive in hybridization tests with the auBt-aitB2 
probe. Similarly, all strains tested with the atlP probe, which included all of 
the strains listed except S1013, S1014, 755, S1019, and 771, were also positive 
with that probe. 

'■ ATCC, American Type Culture Collection, Rockville, Md.; NCTC, 
National Culture Type Collection, London. England; NCPPB, National 
Collection of Plant Pathogenic Bacteria, Harpenden, England. 

d The numbers in parentheses immediately following the species name 
indicate the number of strains tested by hybridization with the attBI-aiiB2 
probe in addition to those listed. All of the strains tested were positive with 
this probe. 



fragment was consistently used as a site of integration, we 
examined C7 lysogens carrying other phages. Integration 
was detected by the disappearance of one and the appear- 
ance of two new fragments hybridizing with the attBl-attB2 
probe after lysogenization of CI. Using this criterion, we 
found that the (3-related tox + phages tr, 8, and 76 (data not 
shown) and the (3-related tox 0 phage 782 (see Fig. 3) all 
integrated into a BamWl fragment that hybridized with the 
probe, whereas phages k and p, which are unrelated to (3 or 
to each other, did not (1). Thus, all (3-related phages tested 
integrated preferentially into the region denned by the attBl- 
attB2 probe. Though these phages displayed a preference for 
attB2 in our study, this was not the case in a previous study 
in which the same strain of C. diphtheriae was used (13). 

Fifteen strains of C. diphtheriae, including representatives 
of the three colonial types and the subspecies of "C. 
diphtheriae {belfanti) ," were next tested for the presence of 
DNA homologous to the attBI-attB2 probe. The strains 
chosen varied in the time and place of their isolation, DNA 
restriction pattern, and presence of the tox gene or (3-related 
DNA. Of the 15 tested, 6 of which are listed in Table 1 and 
shown in Fig. 2, all but one (lane B) contained at least two 
BamYW fragments which hybridized strongly with the attBI- 
attB2 probe. One of these fragments comigrated with the C7 
u«B2-containing fragment, whereas the size of the second 
varied (Fig. 2). In strains known to carry (3 phage-related 
DNA, a third hybridizing band was observed (Fig. 2, lanes C 
and D). This would be expected if the phage integrated at 
one of the attB sites. As expected, the third band was weak 
when the phage inserted into the attBl site since the probe 
extends only a short distance to one side of that site (Fig. 1). 
Presence of attBl -attB2 in strains of C. ulcerans and C. 

. In addition to C. diphtheriae, strains of 



(A) C diphthe riae C7 



QttB2 

EBE \ B 



(ott B1/B2) 
(Hindi) 



\l s , , L B 5.3 kb 
J 3.5 kb 

0.7kb 



attP 
IB? c 



and isolation of DNA fragments from agarose gels were 
previously described (2, 4). Stringency conditions for all 
hybridizations permitted a 10 to 12% bp mismatch. 

All bacterial strains were initially screened for the pres- 
ence of attB sites in dot blot hybridizations with the attBl- 
attB2 probe. Southern blots were then performed on restric- 
tion enzyme digests of the DNAs of dot blot-positive strains. 

RESULTS 

Presence of attBl -attBl in strains of C. diphtheriae. It was 
previously shown that tox + (3 and related <d phage and tox~ 
7 phage integrated into the a»B/-a//B2-containing region of 
C. diphtheriae CI (13, 14). To determine whether this CI 



0.5kb 

FIG. 1. DNA probes. (A) Restriction map of the region of the C. 
diphtheriae C7 chromosome containing attBl and attB2 (14). Plas- 
mid A634 contains both attB sites on an EcoR\ fragment cloned into 
pUC8, which is referred to as the attBI-attB2 probe. A 0.7-kilobase 
HincU fragment called the Hinc II probe was used to probe for DNA 
homologous to sequences adjacent to the attB sites. (B) Restriction 
map of the attP region of phage w cloned in pUC8 to form plasmid 
T30. This insert also contains the carboxyl end of the tox gene for 
diphtheria toxin. An Accl-EcoKl fragment of T30, which contains 
attP but not tox, served as the attP probe. Abbreviations: A, Accl; 
B, BamHU C, ClaX; E. EtoXXX; H, HincU: K, KpnX; S, Sail; kb, 
kilobases. 
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C. ulcerans and C. pseudotuberculosis can also be converted 
to toxinogeny by p-related tox + phages (6, 9). To examine 
these species for sites homologous to the attB sites in C7, 
Southern blots of BamHl digests of 18 strains of C. ulcerans 
and 6 of C. pseudotuberculosis were hybridized with the 
attBl-attB2 probe. Hybridizations with a representative 
sample of strains of each species (listed in Table 1) are given 
in Fig. 3. The majority of C. ulcerans, such as 712 (lane C), 
had one hybridizing fragment; however, tox + strains 690 
(lane E) and 740 and 751 (data not shown) had two hybrid- 
izing fragments. Strain 751, as well as four other strains with 
two hybridizing fragments, all contained p-related DNA in 
excess of the a/rfl-related sites and tox, whereas strains 690 
and 740 did not (6). Five of the C. pseudotuberculosis strains 
had two hybridizing BamHl fragments and the sixth, tox* 
strain 766 (lane G), had three. Overall, strains of both 
species, regardless of their origin or the presence of the tox 
gene or p-related DNA in their genomes, had one or more 
BamHl fragments that hybridized with the probe. The size of 
the hybridizing fragment(s) varied, except in several cases in 
which strains with similar overall DNA restriction patterns 
had similar-sized fragments. In all cases, the degree of 
hybridization with DNAs from C. ulcerans and C. 
pseudotuberculosis was significantly weaker than with 
DNAs from strains of C. diphtheriae. Nevertheless, the 
results suggested that these species contained attB sites 
homologous to those in C. diphtheriae. 

If the a//fl/-a//B2-hybridizing fragments contain func- 
tional phage integration sites, then on lysogenization with (J 
or p-related phage two new hybridizing fragments should 
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FIG. 2. Detection of attBl-attB2 homology in C. diphtheriae. 
Purified bacterial DNA was digested with BamHl and electropho- 
resed in 1% agarose. A Southern blot was made, hybridized with 
32 P-labeled A634 DNA, and autoradiographed. Lanes: A, C7 mitis; 
B, S1013 mitis; C, S1014 gravis; D, S1016 gravis; E, S1015 
intermedius; F, S601 mitis. All strains contain 0 phage-related DNA 
except C7 and S601. The arrows identify the attBl- and attB2- 
containing fragments of C7 in lane A. 
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FIG. 3. Detection of attBl -at tB2 homology in C. ulcerans and C. 
pseudotuberculosis. Purified bacterial DNA digested with BamHl 
was processed as described in Fig. 2. C. diphtheriae (lanes): A, C7; 
B, C7(782). C. ulcerans (lanes): C, 712; D, 712(782); E, 690; F, 
690(782). C. pseudotuberculosis (lanes): G, 766; H, 766(782). 



appear while one disappears. We tested this prediction in a 
number of instances with strains sensitive to p-related 
phages. When C. ulcerans 112 was lysogenized with C. 
diphtheriae phage 782, the result was as predicted for 
integration (Fig. 3, lanes C and D), as were those obtained 
when strain 712 was lysogenized with the p-related tox + C. 
ulcerans phage 731, 761, or 876. Similarly, the attBl-attB2- 
hybridizing fragments provided integration sites when C. 
ulcerans 690 was lysogenized by phage 782 (Fig. 3, lanes E 
and F), strain 755 was lysogenized by phage o>, and strain 872 
was lysogenized by C. ulcerans phages 731, 761, and h and 
C. diphtheriae phage 782. Surprisingly, when p-related 
phage 782 lysogenized C. ulcerans 740 and 751, integration 
into the attBl-attB2 hybridizing fragment was not detected. 
Not surprisingly, C. ulcerans phage a, which is unrelated to 
P, did not integrate into the attBl-attB2 region of either 
strain 712 or 872. The attfl/-attfl2-hybridizing fragments of 
C. pseudotuberculosis 766, 769, and 771 also served as 
integration sites for p-related phages 782, 76, and to, respec- 
tively (Fig. 3, lanes G and H). Thus, in all three species, 
attBI-attB2-related regions contained functional attB sites 
for members of the p family of phage. 

Hybridization of C. diphtheriae, C. ulcerans, and C. 
pseudotuberculosis with an attP probe. The attBl-attB2 probe 
is probably not exclusively specific for attB, i.e., the se- 
quences required for phage integration, since it contains 
about 3 kilobases of C7 DNA. Thus, strains lacking the attB 
sequence might also hybridize with that probe. To resolve 
this ambiguity, we used an attP probe derived from <o phage, 
and thus free of non-attB sequences from C7 DNA, in 
hybridizations. The attP site of <o phage shares a 96-bp 
sequence with the attP site of p and y and the attBl and 
attB2 sites in C7. There is no other sequence homology 
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FIG. 4. Detection of attBl-attB2 homology in Corynebacterium 
species. Purified bacterial DNA digested with Bam HI was proc- 
essed as described in Fig. 2. Lanes: A, C. diphtheriae CI; B, C. 
ulcerans 712; C, C. pseudotuberculosis 769; D, C. renale 412; E, C. 
renale 412(RP28); F, C. xerosis S1001; G, C. pseudodiphtheriticum 
S1002; H, C. minutissimum S1003; I, "C. flavidum" S1017; J, C. 
fcovw S1020; K, C. /wo*// S1018; L, C. fef/ae S1023; M, C. 
glutamicum 13032; N, C. pseudodiphtheriticum S1041; O. C. earn 
S1046. 

between the a/// 1 probe and C7 twB region (G. Ratti, 
personal communication). BamHl digests of DNAs from five 
strains of C. diphtheriae, five of C. ulcerans, and two of C. 
pseudotuberculosis were hybridized with the to-detjved attP 
probe diagramed in Fig. 1. All 12 strains hybridized with the 
attP probe and at a similar intensity with the same fragments 
that hybridized with the attBl-attB2 probe (data not shown). 
In fact, all strains that were positive with the attBl-attB2 
probe and were tested were also positive with the attP probe 
(Table 1, footnote b). 

Presence of ottff-related sites in other Corynebacterium 
species. The presence of sites homologous to attBl-attB2 in 
other Corynebacterium species, as well as organisms in 
other genera, was assessed in hybridization experiments 
(Fig. 4; Table 1). All Corynebacterium species tested hybrid- 
ized with the probe. Some strains, including those of C. 
xerosis, C. minutissimum, " C '. flavidum" , C. bovis, and C. 
glutamicum, had two hybridizing BamHl fragments, and 
those of C. renale and C. pseudodiphtheriticum had one. All 
of these hybridized with an intensity similar to that seen with 
strains of C. ulcerans and C. pseudotuberculosis but not as 
strong as that seen with strains of C. diphtheriae. Strains of 
C. hoagii, C. betae, and C. equi had one fragment, and these 
produced weaker hybridizations than any of the other spe- 
cies. When the attP probe was used (data not shown), it 
hybridized in all cases with the same BamHl fragments as 
the attBl-attB2 probe, but with this probe all Corynebacte- 
rium spp., including C. diphtheriae, hybridized with the 
same intensity, except for C. hoagii, C. betae, and C. equi, 
which produced weaker hybridizations. In other experi- 
ments (data not shown), individual strains of Brevibacterium 
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epidermidis, " Propionibacterium aerobicum" , and five un- 
classified aerobic skin coryneforms exhibited one or more 
BamHl fragments that hybridized weakly with both the 
attBl-attB2 and attP probes. In probes with attBl-attB2 
alone, 10 strains of C.fascians hybridized weakly, whereas 
one strain of "C. genitalium" produced a strong reaction. 
However, strains of Listeria monocytogenes, Staphylococ- 
cus aureus, Clostridium perfringens, and Pseudomonas 
aeruginosa did not exhibit homology with either probe. 

Hybridization of Corynebacterium spp. with DNA adjacent 
to the attB site. The observation that all C. diphtheriae DNAs 
hybridized more intensely with the attBl-attB2 probe than 
did DNAs of other Corynebacterium species suggested that 
the DNA sequences flanking attB 1 , attB2, or both might be 
specific for C. diphtheriae. To examine this, an Hindi 
fragment internal to the probe and located between the two 
attB sites (Fig. 1) was isolated, labeled, and hybridized with 
BamHl digests of various Corynebacterium species (Fig. 5). 
This internal probe hybridized strongly to one of the attBl- 
o//B2-hybridizing fragments of the C. diphtheriae strains 
(lanes A to E) but only weakly to those of C. ulcerans and C. 
pseudotuberculosis (lanes F to K), the latter becoming 
visible only after exposure periods days longer than those 
required to produce strong positive results with strains of C. 
diphtheriae. There was no detectable hybridization with the 
DNAs of C. renale or C. glutamicum even after prolonged 
exposure. 

DISCUSSION 

We showed that sites with homology to the attB phage 
integration sites utilized by (3 and p-related cory- 
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FIG. 5. Hybridizations with DNA flanking the C7 attB sites. 
Purified bacterial DNA digested with BamHl was processed as 
described in the legend to Fig. 2. The probe was the Hindi fragment 
of A634 as described in the legend to Fig. 1. C. diphtheriae (lanes): 
A. C7; B, S1016; C, S1015; D, S601; E, 820. C. ulcerans (lanes): F, 
712; G, 740; H, 751; I, 872. C. pseudotuberculosis (lanes): J, 766; K, 
769. Lane L, C. renale 411. Lane M, C. glutamicum 13032. The C7 
fragment in lane A carried the attBl site. The DNA in lane D was 
partially degraded. 
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nebacteriophages in C. diphtheriae C7 were present in all of 
the Corynebacterium species and related organisms exam- 
ined. The organisms carrying these sites included human and 
other animal pathogens, members of the normal human flora, 
and organisms isolated from soil and plants. Some of these 
organisms are known hosts for phages of the p family, but in 
some this relationship has not been detected and for many 
has yet to be examined. Where tested and testable, specifi- 
cally in strains of C. diphtheriae, C. ulcerans, and C. 
pseudotuberculosis, a/f/Mreated sites were the site of inte- 
gration for one or more members of the p family. The single 
exception was the failure of tox° p-related phage 782 to 
integrate into afffl-related sites in two strains of C. ulcerans 
even though it did so in others. 

In contrast to the extensive information available on the 
requirements for the integration of lambda phage in E. coli 
(17), very little is known about the specificity requirements 
for the integration of members of the p phage family into 
Corynebacterium species. The data suggest that the att sites 
in both the phage and bacterium contain a common sequence 
of % bp, far larger than the 15-bp common core found in the 
lambda-E. coli system, though it is not known whether all of 
the 96 bp are essential to the integration process. In addition 
to a common core, integration of lambda depends on the 
presence of fairly large and specific flanking sequences 
outside the phage common core and, to a smaller extent, 
outside that of the host. The observations in this paper and 
those by Rappuoli et al. (13) on the integration of p-related 
phages and our own observations with phage 782, whose 
a/r/'-containing BamHl fragment showed no detectable ho- 
mology with that of p (3), suggest that very little phage 
sequence specificity may be required for integration outside 
the 96-bp sequence shared with the bacterial genome. In 
addition, the weaker hybridizations we obtained with the 
attBl-attB2 probe in species other than C. diphtheriae, in 
which p-related phages integrate, i.e., C. ulcerans and C. 
pseudotuberculosis, and their strong hybridization with the 
attP probe equivalent to that seen with C. diphtheriae, 
suggest that large flanking sequences on the bacterial chro- 
mosome are not required for integration either. Neverthe- 
less, much remains to be learned in detail about site-specific 
recombination in the corynebacterial system. With respect 
to the anomalous behavior of phage 782 noted above, it can 
be argued generally that mutation either in the common core 
sequence of the host or an unfulfilled specificity requirement 
of flanking sequences prevented integration in this case. 
However, that alone would not explain our failure to detect 
the actual site of phage 782 integration with the attP probe, 
assuming that it was similar to the primary site. Additional 
explanations would have to be devised to account for this 
discrepancy. It is worth noting that, though we previously 
failed to detect any homology between the aff/'-containing 
BamHl fragments of phage 782 and p phage (3), we were 
able to do so with the smaller, more specific attP probe used 
in the present study. 

The number of <wfl/-attB2-hybridizing fragments found in 
various species of Corynebacterium differs and is a function 
of the intrinsic number of attB sites that each strain carries, 
variation in the BamHl restriction sites which might prevent 
identification of two or more contiguous sites, and the 
presence of p-related phages. Integration of phage into an 
unoccupied site, in theory, yields two sites with the identical 
sequence. In all of the cases tested, we found that 
lysogenization into the attB site resulted in just such an 
increase in hybridizing sites. Strains of C. diphtheriae, 
including some not carrying p-related phages, had at least 



two sites, whereas most strains of C. ulcerans had only one 
site. It has been suggested (11, 14) that the presence of two 
sites in C. diphtheriae may have been due to duplication of 
the attB site and presumably enhanced the possibility of 
increased diphtheria toxin production by permitting integra- 
tion of converting phages at both sites. The observation that 
one of the BamHl fragments hybridizing with attBl-attB2 is 
the same size in all but one of the C. diphtheriae examined in 
this study, whereas the second varies in size, is still unex- 
plained. Integration at both C7 sites has been reported (13) 
and would be expected to produce size variation in both 
fragments. It is possible that in these strains phage was 
preferentially integrated into one site, thus producing varia- 
tion in the size of only one of the BamHl fragments. 

The most intriguing observation of all is the wide distri- 
bution of the af/fi-related sequence among Corynebacterium 
species and related organisms. It could be argued that the p 
family of phage is of ancient ancestry and has accompanied 
the coryneforms as they evolved from soil and plant species 
to species inhabiting man and other animals. This would 
explain the persistence of these sites but would only do so if 
they were still being used by phages carrying the homolo- 
gous attP sites. So far, p-related phages have only been 
identified in strains of C. diphtheriae, C. ulcerans, and C. 
pseudotuberculosis (4). Phages isolated from C. 
pseudodiphtheriticum are morphologically and antigenically 
distinct from p phages (15), and a C. renale phage, RP28, 
though morphologically similar to p (7) but lacking homology 
to p in DNA hybridization tests, did not integrate into the 
attB site present in the genome of its host (unpublished 
data). It is clear that much more will have to be done before 
the hypothesis of coevolution is adequately tested. Alterna- 
tively, it could be postulated that these sites were retained 
because they play some other role in the genetics, and hence 
evolution, of the corynebacteria, though there is no evidence 
in support of this hypothesis. Finally, it is of interest that, in 
three strains of C. ulcerans and one of C. pseudotuberculosis 
in which the tox gene is present in the absence of any other 
detectable p-related DNA (5), one of the BamHl fragments 
hybridizing with the attBl-attB2 probe also contains the tox 
gene. Whether this reflects the original relationship between 
tox and a phage integration site or a residue of some past 
interaction of the site with a phage has yet to be determined. 
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